278 research outputs found
A Task-based Evaluation of French Morphological Resources and Tools
Morphology is a key component for many Language Technology applications. However, morphological relations, especially those relying on the derivation and compounding processes, are often addressed in a superïŹcial manner. In this article, we focus on assessing the relevance of deep and motivated morphological knowledge in Natural Language Processing applications. We ïŹrst describe an annotation experiment whose goal is to evaluate the role of morphology for one task, namely Question Answering (QA). We then highlight the kind of linguistic knowledge that is necessary for this particular task and propose a qualitative analysis of morphological phenomena in order to identify the morphological processes that are most relevant. Based on this study, we perform an intrinsic evaluation of existing tools and resources for French morphology, in order to quantify their coverage. Our conclusions provide helpful insights for using and building appropriate morphological resources and tools that could have a signiïŹcant impact on the application performance
Le projet RESTAURE
National audienceLe projet Ressources InformatisĂ©es et traitement automatique pour les langues rĂ©gionales (RESTAURE) est un projet financĂ© par lâANR, entamĂ© au mois de janvier 2015 pour une durĂ©e de 42 mois. Il comporte trois objectifs principaux :âą acquisition et normalisation de ressources (corpus et lexiques) ;âą dĂ©veloppement dâoutils pour lâacquisition et lâanalyse de corpus ;âą diffusion des rĂ©sultats auprĂšs du grand public.Les langues rĂ©gionales de France concernĂ©es par le projet sont au nombre de trois : le picard, lâalsacien et lâoccitan. Chacune de ces langues est reprĂ©sentĂ©e par un laboratoire partenaire : LESCLAP Ă Amiens pour le picard, LiLPa Ă Strasbourg pour lâalsacien, et CLLE-ERSS Ă Toulouse pour lâoccitan. Ă cela sâajoute un laboratoire en rĂ©gion parisienne, le LIMSI-CNRS, qui travaille sur les aspects de traitement automatique des langues.La motivation principale du projet est le manque de ressources informatisĂ©es pour les langues rĂ©gionales de France, en particulier pour les trois langues concernĂ©es par le projet
Question Generation for French: Collating Parsers and Paraphrasing Questions
This article describes a question generation system for French. The transformation of declarative sentences into questions relies on two different syntactic parsers and named entity recognition tools. This makes it possible to further diversify the questions generated and to possibly alleviate the problems inherent to the analysis tools. The system also generates reformulations for the questions based on variations in the question words, inducing answers with different granularities, and nominalisations of action verbs. We evaluate the questions generated for sentences extracted from two different corpora: a corpus of newspaper articles used for the CLEF Question Answering evaluation campaign and a corpus of simplified online encyclopedia articles. The evaluation shows that the system is able to generate a majority of good and medium quality questions. We also present an original evaluation of the question generation system using the question analysis module of a question answering system
Es esch fĂ scht wie Ditsch, oder net? Ătiquetage morphosyntaxique de l'alsacien en passant par l'allemand
National audienceLâanalyse morphosyntaxique est une pierre angulaire de nombreuses applications du traitement automatique des langues. Elle nĂ©cessite toutefois des efforts de dĂ©veloppement consĂ©quents, quâil sâagisse dâannoter des corpus ou de produire des lexiques et des outils. Pour les langues moins dotĂ©es, il peut ĂȘtre utile dâexploiter leur proximitĂ© avec des langues disposant dâoutils et de ressources. Dans cet article, nous nous intĂ©ressons plus particuliĂšrement aux dialectes alsaciens, qui prĂ©sentent de nombreuses similitudes avec lâallemand standard. Nous montrons quâil est possible dâutiliser des outils dĂ©veloppĂ©s pour lâallemand afin de rĂ©aliser lâanalyse morphosyntaxique de textes en alsacien. La mĂ©thode consiste Ă transposer les mots outils des textes alsaciens vers leurs Ă©quivalents en allemand standard. Cette transposition nĂ©cessite pour seule ressource un lexiquebilingue des mots outils
De-Confinement in high multiplicity proton-proton collisions at LHC energies
Recently, the CMS Collaboration has published identified particle transverse
momentum spectra in high multiplicity events at LHC energies =
0.9-13 TeV. In the present work the transverse momentum spectra have been
analyzed in the framework of the color fields inside the clusters of
overlapping strings, which are produced in high energy hadronic collisions. The
non-Abelian nature is reflected in the coherence sum of the color fields which
as a consequence gives rise to an enhancement of the transverse momentum and a
suppression of the multiplicities relative to the non overlapping strings.
The initial temperature and shear viscosity to entropy density ratio
are obtained. For the higher multiplicity events at =7 and 13 TeV
the initial temperature is above the universal hadronization temperature and is
consistent with the creation of de-confined matter. In these small systems it
can be argued that the thermalization is a consequence of the quantum tunneling
through the event horizon introduced by the confining color fields, in analogy
to the Hawking-Unruh effect. The small shear viscosity to entropy density ratio
near the critical temperature suggests that the matter is a strongly
coupled Quark Gluon Plasma.Comment: 5 pages, 4 figure
Building an ontology of cardio-vascular diseases for concept-based information retrieval
International audienc
Approches à base de fréquences pour la simplification lexicale
National audienceLa simplification lexicale consiste Ă remplacer des mots ou des phrases par leur Ă©quivalent plus simple. Dans cet article, nous prĂ©sentons trois modĂšles de simplification lexicale, fondĂ©s sur diffĂ©rents critĂšres qui font qu'un mot est plus simple Ă lire et Ă comprendre qu'un autre. Nous avons testĂ© diffĂ©rentes tailles de contextes autour du mot Ă©tudiĂ© : absence de contexte avec un modĂšle fondĂ© sur des frĂ©quences de termes dans un corpus d'anglais simplifiĂ© ; quelques mots de contexte au moyen de probabilitĂ©s Ă base de n-grammes issus de donnĂ©es du web ; et le contexte Ă©tendu avec un modĂšle fondĂ© sur les frĂ©quences de cooccurrences. ABSTRACT Studying frequency-based approaches to process lexical simplification Lexical simplification aims at replacing words or phrases by simpler equivalents. In this paper, we present three models for lexical simplification, focusing on the criteria that make one word simpler to read and understand than another. We tested different contexts of the considered word : no context, with a model based on word frequencies in a simplified English corpus ; a few words context, with n-grams probabilites on Web data, and an extended context, with a model based on co-occurrence frequencies. MOTS-CLĂS : simplification lexicale, frĂ©quence lexicale, modĂšle de langue
Fatal Mycobacterium tuberculosis infection despite adaptive immune response in the absence of MyD88
Toll-like receptors (TLRs) such as TLR2 and TLR4 have been implicated in host response to mycobacterial infection. Here, mice deficient in the TLR adaptor molecule myeloid differentiation factor 88 (MyD88) were infected with Mycobacterium tuberculosis (MTB). While primary MyD88â/â macrophages and DCs are defective in TNF, IL-12, and NO production in response to mycobacterial stimulation, the upregulation of costimulatory molecules CD40 and CD86 is unaffected. Aerogenic infection of MyD88â/â mice with MTB is lethal within 4 weeks with 2 log10 higher CFU in the lung; high pulmonary levels of cytokines and chemokines; and acute, necrotic pneumonia, despite a normal T cell response with IFN-Îł production to mycobacterial antigens upon ex vivo restimulation. Vaccination with Mycobacterium bovis bacillus Calmette-GuĂ©rin conferred a substantial protection in MyD88â/â mice from acute MTB infection. These data demonstrate that MyD88 signaling is dispensable to raise an acquired immune response to MTB. Nonetheless, this acquired immune response is not sufficient to compensate for the profound innate immune defect and the inability of MyD88â/â mice to control MTB infection
- âŠ